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(57) Abstract 

DNAs represented by Sequence Listings (1 and 2) and a process for producing proteins which comprises inserting these DNAs into 
expression vectors to thereby produce proteins having molecular weights of about 60 kD (under reductive conditions) and about 60 kD and 
about 120 kD (under nonreductive conditions) and being capable of inhibiting the formation of osteoclasts. These proteins are useful in the 
treatment of osteoporosis and rheumatism. 
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i&mirz-y-j ht>>f yt it, mmmmmmm^-y r i u -(fibroblast grow 

th factor ; FGF : Rodan S. B. et al., Endocrinology vol. 121, pl917. 1987). 

y~>a. U ymigMm^- [(insulin like growth factor-I ; IGF - I : Hock 
J. M. etal., Endocrinology vol. 122. p254. 1988). << > > a. U >^it$iiK 
■T--N (IGF -II: McCarthy T. et al., Endocrinology vol.124, p301, 198 
9). T ? t£> A(Activin A ; CentrellaM. et al., Mol. Cell. Biol. vol. 
11, p250, 1991). h => yxv i >ri#5j6H^- £ (transforming growth f 
actor-/3 ; Noda M. , The Bone. vol. 2. p29, 1988) . '<X*- ^ a h a b° XVas 
culotropin ; Varonique M. et al.. Biochem. Biophys. Res. Commun. vol. 19 
9, p380, 1994). &C/WJr# J^fiScHT-:? t i U-(bone morphogenic protein ; 
BMP : BMP-2 ; Yamaguchi. A et al., J. Cell Biol. vol. 113, p682. 1991, 
OP-1 ; Sarapath T. K. et al.. J. Biol. Chem. vol. 267, p20532. 1992, Knut 
sen R. et al., Biochem. Biophys. Res. Commun. vol.194. pl352. 1993) W£(D 

htt^ytLXiZ, b^y^yyt-ly^^Mm^-B (transforming growth 
factor- & ; Chenu C. et al. , Proc. Natl. Acad. Sci. USA, vol.85, p5683, 

1988)^^ >^-o^ + >- 4 (interleukin-4 ; Kasano K. et al. , Bone-Mine 
r., vol. 21, P179, 1993)^fg^£nTt^ 0 X. 5£#$fflflSc: X 3#TO£Jfp 
ft\-?Z>-y-J b t)4 y t LTIi. tih'y h — Xcalcitonin ; Bone-Miner., vol.1 
7, p347. 1992 ). v ? n ~3 T — V a □ ^.-SMB^P (macrophage colony- 
stimulating factor; Hattersley G. et al. J. Cell. Physiol, vol.137. pl99, 
1988) .-T >^-a>f + >-4(Watanabe, K. et al., Biochem. Biophys. Res. 
Commun. vol. 172, pl035. 1990). M^>^-7id> - r (interferon- 7 ; 
Gowen M. et al., J. Bone Miner. Res., vol. 1, p469, 1986)^^181^ £ tlT I ^ 
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1 u-o-^-r ht>J>m, ±i2cD"tj->r h# y >©— ^(::o^T(i#ttijejf##ji: 

LT^^^^nsffi^tiT^^o x. *^^h->n %®.m&<Dfemm, mm 
(Dmmftmiknxi^tzo 

MI MR -90 (ATCC^I£-g:f€#^ CCL186)©i^«^(r^#teflS^gQjf|I^J^fL 
W^JK#Jftflia^filc«lM»14«:W"rSSfi»0CIF«-l,aj It^S (PCT/JP96/00374 

o c©sK#aiiia^wi«'j©tt*w-rssa«oc[FcDa^(c^^T. ££>ti£& 
fe&(D$>&m&w&%$L-fz>z. ttDx^sz^? ^-£m&u cn^ffl^rafE 
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(a) #i=-*(SDS-PAGE ; 
(i) iI^&#T-C&60kD 

(ii) ^^^c^#TT^60kDS^120kD 

(b) T ^ y^SS^iJ; 

(C) £^14 ; 
(d) m^^tt ; 

(i) 70°C\ I0frffl£ ^{i56°C. 305>^0>JlDMS(l cfc 0 ®[#&a0&©#ft • j£B 
(ii) 90°C. lOftfflczm&MmizJ: <0W.^mmo)ftit • (R«iffii*rj^tt^ffet>nSo 

3fU ^Sfe^J4(iii) (C&tf^ ^^VADNA^Mbtllbn/: 

ti^? ^-pwesr aociF^h7>x7x^ FU; cos7 m^m±m mmm 4 

(in)). 3 ti^? ^-pWESR fl^F7>X7xnU; C0S7 fcBJ&f£*±fi| (*f 
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ti, t h flu 31 yV ADNAt^X; FX^-^ffl^TnX; K?^^ U 
fESU 'J -£0CIFc DNA^^Cf^UDNAfi^^o- 

M^^^sociF^^it-r^c t^t-§6o #«=>n/c«K#»i!ia^fiJcffli©jsi*«-w-r 
Hii l 

^ K5^7*5 'J -CDftUK 
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t hl^fyy ADNAC? d- y-r-y 9 ft ■ Cat. No. 6550-2) irpWE 15 ^Xl K^7 

jnzm^xmrnLtzfr^ dna . *n§Sk y T-^^m.o-mmyjmu. Moiecuia 

r Cloning : A Laboratory ManuaKCold Spring Harbor Laboratory(1989))£-# 
(i) b h • f J A DNA ffll%^t#P=%)c£>tilg 

1.5ml CbJLys<y FJl7?-^--J°4^ (f a-7"A . B . C . DXciOmM Tris- 
HC1 . lOmM MgCl 2 . lOOraM NaCl^fr^ 750 wl C^^l/cl: M^l^VA 
DNA£-etl^:ft lOO/zgAtU ^a.—fk iZit 0. 2 «y K KteO. 4 

K f-a-^C (rii 0. 6^L— >y K ^-7'D 0. 8^l— y h<Dpmm 

mUbol&mU LT 1 Bf mm<tLtc 0 Ztl^'tlcD?- *-7~lZ2QwMlZteZ> £ 

-5 tCEDTA^^fiD LTStC^±d6. 7x7-;l/?DD*iA (1:1) "CI* 
HSU 2fg»CDx^ y -A-£jfjnx_TDNA ^Zti^^1t^ 0 il4>#*tTDNA £ 

miRL^^t. io%3L? j ^ n-en©f a -^©^©dna^ ioo/zi 

© TE d^8?L/Co 4^©f-^-7*©DNA £l ^C^tfc, 68°C(IT105>1*S L 

-n^ii^W (38 ml)©^-CfpS3 tfcl0%-40%ilj^^-> a 
«I^S^Se(CfiJg L/Co ^ 3«i^S^S2li20iiiM Tris-HCl(pH8.0). 5mM EDTA> 1M 
NaCl Z^tfMffiM&tSifrTftmLtzo -Oii>i:>W^QAt^^f5fSRP28SA a-^ 
-£ffl^T20°Cr26, OOOrpm (rT24B#r Q 1ii>^ U/c© 77^3>^u^-^ 
ffll^T^ 3ffi®m*)m%:0. 4ml fo©77^ 3 >C^gU;o 3-7^?^3> 
©^^0.4 %Ttfu -XSm^UKC^ftTDNA ©-tr >f X*»g b fcj; 
-£30kb (^o^-x^T) A>«^40kbCDg^CDDNA te^ti? *7 s > «g 
iRJt^lO^J^Tt-^S <fc -9 TET'^fJLfctD^x^y 2. 5f§ftJ!lDX.TDNA £ 
ftl^-tir/co DNA (i TE (lOmM HC1 (pH8. 0) + ImM EDTAISM OsJT TE 
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£ttfc:/n h 3-;Mr^oT$«S?i?BamHI C «fc oT5££?gffc L/C©*., jl 9 J 
-JlttmiZ £ oTDNA ^rHJlRU lng/ml©agJg(I^: 5 «fc o TEKig# L/c 0 d ©DNA 

(iii) y; ADNA g)^^ g >%LKin vitrei *r— ! Ji/7 

1.5#g Ot^X^Slfcy y ADNA t3//g ©frJPSSliP? BamHTOgffcU/cpW 
E15 K^?*-£ 7 r^vi^T #©Ready-To-Go T4DNA 7-ry'-X^ffll> 

T20//1 ©St&Sffi^T^-T y-^a > L/io 3 > L/cDNA 4^*77 

-^i:lotin vitro^>yy->>>rLfc 0 Steffi©— 
SB«-SM»«teT*iS:F»W{r^b, lOmM MgCladSg® L/cTvU^XLl-Blue MR(X h 
5^^— >tfc) tl^bt7 7- ^*i^^it/;0^ > 50 ju g/ml © 7 > b° :> 'J > 

±IB©^^T^L^^°«y>7--^>rSn£:^i:^:Jl®XLl-Blue MR £g£ U 
figl5cmCDT#"o-x:7U- h^fc 0 50. 000^@©=J nx-^^f |, J; -9 (IT > t° 

„ 7"U- h— tfc^/c03ml <DLmi&&m?LTXWjWi(Dzi az-^ga U 0JR L 
/c 0 7#u-x:/U- h££ £>Cl3ml ©LBfgff&Tl HJ#fct\ c ft£X©7v:fl§^ 

mizZkbb^ 7mza-jl&20%£teZ>J:?lzffitaL. £ bC7 > t: y > £-50 
#g/mL<h/<£<5 J;-5(;:jfjnx_7c 0 -^^L/c©^— g|3£#lKU ^0 £-80°CC:f£ 
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nmm 2 

50x/g/ml©T > f :> »; >^-g-trEgl5cm©LBT^o-X^U- h (Ijg^H. 2cm 
/c^50, OOOflSI ©^Biffin n^-^^-T^ck -5 ^ K7^^7U-*il % 37 

ffll^TDNA ^-Fn-fe^D-X7^^-Jii:S^L/; 0 ^bC:07^;^- 
*«EE^--^ >4'-e80 o CT-2 8^KJfjp|»iL/c 0 c: CO J: -5 C&S Lfc— h a -tr - 
X7^^-^ t hOCIFcDNA©5' ;fcifls<i:3' 5M8^£ftMKLfc: 2SEODNA £:/a 
-T'tlTM^'J^XU:,, fiP*>. 0CIFcDNA£3"fr*:l&® pBK/OIFIO GM 
^H^XSS^P^^^rX^X^S#jW%Eff^I€, ^l£#^-FERM BP-5267) J: t> ^ 

K£1flSS!U OCIFcDNA^ir^X * K£#JPSS^KpnI irEcoRI T7NHtL 
. tf^77^7 > h£T#n-;*Y;U£m^T#flt Ufc©5. 0. 2kb ©Kpnl/E 
coRI^5 > h£QIAEX XW^s.^t, h7?i/s> + 7 h ( + 7f>|t) 
l^TfifjRL-^o -©DNA £ ^ ff'f-y -4 ADNA 7^'J>y^XfA (Tv~> + A*± 
SD £ffll^T 32 P "C«»b/i (5'-DNAya-y) 0 SfcglJc:, flbtlfc^X; 
K^flJRRim BamHltftJPfil?^ BcoRVTfSffc: LT£-f '30. 2kb OBamHI/EcoRV 7 

> hfcl^lKflfiU _h§8©73&-e 32 P Igf&L/c (3'-DNAy°o-77) 0 ± 
Ifiuyj * 7 4 )19-<Do — ft£5' -DNAT/n-:/^ SO©— tfc£3' ~DNA7°a — 
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DNA Xfifc cfc^38kbTfeo/c 0 

fgJ»J3 

(i) ocip a dna g>-»y^p-^>y 

KpWE0CIF£#J&BI^EcoRl£m^Tf8ffcU £ bfc 7 5 r ^ > h £ 0.7 

-T n >flg (Hybond-N, Tv i/ v Atfc) U XF7^'J>*- (xF?^^- 
>#) ^-ffl^TDNA «-^>r n >mizm?£Ltz 0 — ^T. ^^X^ KpBKOCIF ^$IJPS 
SI^EcoRI K<fcoTflHfcU t hOCIFcDNA£#tfl. 6kb <D777* > h£T#n 
-xy^^-m^T^SIL^©^. ^ if 7° ^7 4 A DNA 7^'J>r^XrA (7vi/ 

6kb ©OCIFcDNA^^-ryj y>T X^1±/"c^. 6kb . 4kb „ 3. 6kb . 2. 6kb©DNA 
7 ? 7* > F^^7''J kfrt>fr~>fco t M)CIFcDNAt'W:7ij y 

npBluescript II SK+^?^- (XHr;->|t) ©EcoRI if-T h KfiTfec:^ 

•jW^d-^^u n^titzT'^xi K^^n^npBSE6 N p bse4 „ pbse 

3.6 „ pBSE2. 6 tft% L/co 

-hlS^^Xi YlZ-V-7 7 a-~>7&tltzt hOCIF^VADNA CDJkKBE?iJCD#i 
^ClliABI 7 A •r-tlri; 9- i ^ - 9 J 9 - 9 i; y y v ? ^ - \) f 
?i; 3 yjf y \. £373 >- 9 x > > A (7^ 
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1 0 

7^F^ti/XfAXtt) %ri£mLtz 0 iSSK^^ffi^v^v-ti t bOCIF 
cDNAcDJMSie^iJ (@B?im@2?iJ3Mf4 ) .fci^J&L^o £7^ f&g£K3W&/£ 

nmm^m^ i s^2i:^t 0 ge^j#^- 1 citiociFiffcT-oig nr; >#^£ 

COS - 7M(;:££ffl^&;tg!J0CIFg>4£jg| 
(i) QCIFy; ADNAM^X; H'CDfP^ 

OCIF^V ADNA£l&M)l&T^£^3/cd6tl. a^i K^<? ^- P WE15(X 
Y^y- 9 iS-yft.) [Z.^m.'f'y^l KpcDL-SR a296(Molecular and Cellar Bio 
logy, vol.8, P466-472, 1988)©2i§f& a.- y h A L/co ST. ^I^^^X^ 
KpcDL-SR a296 £flJKHfJfSal ITTSft LTSRa :/o ^- ^ SV40*£8Ji* ^ 

^/n-XSt*iCJ;ot^l^ QIAEX >^> y h (4-- 

X'ffiltL^ Tifn-xm%ifcWHZ&^Xftm&. 8.2kb CDpWE15 DNA £QIAEX II 

DNAlrH-«^DNA7'7>f^>^>^ (Siiltt) 

^M^X; KpWESR a&*T?yii^l* ( + 7y>*t) ^ffi^tftSLfco 

Iff ISO) T#^n/^^38kbcD0CIFyy ADNA^#P A^fc^X S KpWEOCIF 
£$JFgglitNo tlT^fb LT &j38kb©0C IFfy ADNA^OJUU T # o - x E 
^aiCiot^llt^ QIAEX II^x?^h7^^3>+9 h ( + ry>*t) 
^m^xmULtz 0 — KpWESR a£f|iiJIW«EcoRI U 7 



WO 98/07840 



PCT/JP97/02859 



1 1 

pmmmEcoRl T^Hfc^n KpWESR tEcoRI-Xmnl-Notl T^-7^-(#H05. #1 
156 ; — a->f K'^^^tt) £T4 DNA'J#--t? (^©itft) £ffl<^ 

T*g£U 7^a-^t^»i:J:oT7'J-0TW-t7}i^ QIAEX II 

T^fb^n/c^STkbcDOCIF^V ADNAtry^^-^ftJjnb^pWESR a £T4 D 
NA U #--tf (^fiigft) £flR*T*g£U >y ? [I'^y y >f ^ > 7'i ? X h 

=>>? v (x h^-^^->tt) ^mi^r^f >^ h da'^^ x 

H§^XL1-Blue MR (Xh7-^^->tt) C^^^1tfc 0 f# £>n/^W3fc&**£ 
WMtS-e. OCIFyy ADNA*<#A£ttfc5gm:3X i KpWESR aOCIF€-*Ty> 
(^T^Wl) £fflt>TfflfS!l>fco 0CIF3BS=3X* KpWESR aOCIF^i* 

(ii)QCIFy J ADNAg) K 5 > h tt3B%;RcmiFg1*feg>flllt% 

BuIS(i) T:f#^n^OCIF|g^nx 5 KpWESR aOClP£fflt^T, JKiTfwiB^£7j 
&T*ft*»*.S0CIF£38«,<*-ti\ -e©?£t££ffliJ/£ U/c 0 8 xio 5 flil©C0S-7 
(3ffc^»F£f!ff*M&BHS6aRfi\ RCB0539)£ 6 ^xi^l/- h£D#^ x ;Kilo«4^ 

IB, WcgL &Jfa^DMEMigi&-eM£-#fc^U/c 0 h5> 

I FIB Jg, 3X5 KpWESR aOCIFt'J^^x^^ i 

•7x7KOtel!&cJ[inx.^o *H&i: ITM^X * KpWESR a^fflL\ iRBHSKl^liK 

m^tz 0 ffl^f^x; kdnasc/'J x ? * * > ©fin -en-en 3 /zg so' 

12// 1 Xb-otzo 1.5ml ©§r Ll^EX-CELL301i£ife (JRH 

^-ilDX.. ^ ^{C48Bfra1^, *&«j£iIIt|XU cn^OCIF^ 
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MM Vol.34. p999 (1989)) MTakahashi N. et.alcD^&CEndocrihology vol 
.122. pl373 (1988)) (_3_^ffl!l3£ L/_ 0 tk&tfana V X#MM2t^ h<D%iVk 

COCIF) (Z>j_14_bfc 0 irtlt>t>, 96^ x ^7 >f ? □ :/ u - h cd£.h7_.;M_. 2 
X10' 8 2 ©7g1_l3 hf* ^ >D 3 tlO%^!^stm^:^t:a - MEMmm 
BRLtfc) T^-^L/c-t>^;U 100 ^ 1 £AfU ^170OvOXtW8a3 

xio 5 m% loo u\ mo%^te!%Mm&$tta - MEMt&mizmm&'&xftnL^ 

5% C0 2 . 37°C. Mm. 100%(lT-iSH±^#L/c o 3 B 0 t 5 B B fr. f« 
ffi©-5*> 160 ^ 1 £0S§gU 1 xlO- 8 M^ttM fc"* * >D 3 &O10%^BSJ£_uf 
£^fra -ME M^i_Ti£«RL7^ 160//] L/c 0 *S«7B^{C«U 

H_£"> ym&mw&m&mTk-cfofrLtz&x.? ; -ji/T-b h> (i : 1 ) 

>y KAcid Phosphatase. Leucocyte. Cat. No. 387-A ; ^7"v?±) £fflWc$fe&T' 
t&ffl L/cc fi:HBfc#£TT*CDi&t_*;*:7 7- ^--tfffi14»14ilfflll&©«^^0CIFS14 
tU: a 1 (C^-To CCD*Sm«fc*9. [MR-90<£>±£^» ^n/*3^^M 

COS - 7«'M^/:»fiDOC I F fi§f_ 
f&ffi^ 1/10 1/20 1/40 1/80 1/160 1/320 

OCIF genomic DNA#A ++ ++ ++ ++ + 

^?9-mx ______ 

______ 
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1 3 

est-. + + itwnmmmm<s o %m±nm^n^m^^ +itmnmmr^ 
#3 o-8 o %mu&tiz>mn&^ -temmm&ztiKi^ t%7jkir 0 ) 

±IS(ii)T#^tl/cOC[FS14M^ffl^> 1 10// 1 ©SDS-PAGE 

ffl-^>-/^^^y -7 y - (0.5M Tris-HCl , 20% ^'itn-^ 4% SDS. 20// 
g/il7"DA7x;-;l,7^-, pH 6.8) ^fiD^TlOO °CT3 #fSIi#;8& L/c© 

K7^7'D 7 f^ i^^-fy-j Kit) ^fflt^TSfiW^y^^^PVDF^ 

>7'U> (ProBlotU A'-+> X ;l,7-t) i:7'D. y f ^>rL/; 0 ^©^>:7' 
U y ft(::#^ft/cOCIFM&W£®^^l^-*+->y--tf 

T2 B#HSH£i£Lfc 0 ECL ^XrA (77i/ t Att) t^TfiLOCIFftf* 

^lTt^iei^ttjl/c 0 W> 1 ia(C^-TJ:-5(lpWESR aOCIF£h^>x 
^nU; C0S-7ijffl!}a©^»_L^^e(i, 120+ o ^ h y ^60+ o 

h >© 2 ;£©'<> K/^t&ai£n/c 0 — 7j\ pWESR a^<? ^-©<&£ h ^ >x 
hLfcCOS-7 »©^»±?t^[^#©7Jj*T)l?tlf L/c|§m. 120 
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£ FJf : 0*Si«Io( (irtr* 1 TB 1 ^3-^- 
$I€B : ¥fiK7^ 6 2 1 B 

(^gc7^ 6 £ 2 1 B^mmtZtl, ¥l&7¥ 1 0 £ 2 5 BtCT/y 

S'ft : FERM BP-5267 
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se?ij#-^ : 1 

IB^iJ©S$ : 1 3 1 6 

nm^m : MS? 

h # n i? - : W.mVt 

ffiPlcomW. : genomic DNA ( t h OCIFy J A D N A - 1 ) 

%m : 



CTGriAfiACAT 

\j I UUnUnvn I 


ATAACTTGAA 


CACTTGfiCCC 


TfiATHfinGAA 




Anr.r.AfTTTT 

/lUVJUrtU 1111 


60 


TCAGCCATCT 


GTAAACAATT 

VI M. i 1 M ft J 1 Vf i li 1 1 L 


TCAGTGGCAA 


CCCGCGAACT 


GTAATCCATG 


AATGGGACCA 


120 


CACTTTACAA 


GTCATCAAGT 


CTAACTTCTA 


GACCAGGGAA 

VJ M ■ \f \tf I 1 VI VI Vf 1 1 1 1 


TTAATGGGGG 

J. A t 11 1 1 VI VI VI VI 


AGACAGCGAA 

J 1 VI J 1 V/ i 1 VJ V/ vf /ill 


180 


CCCTAGAGCA 


AAGTGCCAAA 


CTTCTGTCGA 


TAGCTTGAGG 


CTAGTGGAAA 


GACCTCGAGG 

VJ J 1 Vr V/ 1 V/ VI * \ VI VI 


240 


AGGCTACTCC 


AGAAGTTCAG 


CGCGTAGGAA 


GCTCCGATAC 


CAATAGCCCT 


TTGATGATGG 

■1 1 VI 114 Vf f 1 A> VI VI 


300 


TGGGGTTGGT 


GAAGGGAACA 


GTGCTCCGCA 


AGGTTATCCC 


TGCCCCAGGC 


AGTCCAATTT 


360 


TCACTCTGCA 


GATTCTCTCT 


GGCTCTAACT 


ACCCCAGATA 


ACAAGGAGTG 


AATGCAGAAT 


420 


AUtAl/UuOl/ 1 


1 1 AobOl/tAA 


IlAUALAl I A 


GTTAGAAAAA 


TTCCTACTAC 


ATGGTTTATG 


480 


TAAACTTGAA 


GATGAATGAT 


TGCGAACTCC 


CCGAAAAGGG 


CTCAGACAAT 


GCCATGCATA 


540 


AAGAGGGGCC 


CTGTAATTTG 


AGGTTTCAGA 


ACCCGAAGTG 


AAGGGGTCAG 


GCAGCCGGGT 


600 


ACGGCGGAAA 


CTCACAGCTT 


TCGCCCAGCG 


AGAGGACAAA 


GGTCTGGGAC 


ACACTCCAAC 


660 


TGCGTCCGGA 


TCTTGGCTGG 


ATCGGACTCT 


CAGGGTGGAG 


GAGACACAAG 


CACAGCAGCT 


720 


GCCCAGCGTG 


TGCCCAGCCC 


TCCCACCGCT 


GGTCCCGGCT 


GCCAGGAGGC 


TGGCCGCTGG 


780 


CGGGAAGGGG 


CCGGGAAACC 


TCAGAGCCCC 


GCGGAGACAG 


CAGCCGCCTT 


GTTCCTCAGC 


840 


CCGGTGGCTT 


TTTTTTCCCC 


TGCTCTCCCA 


GGGGACAGAC 


ACCACCGCCC 


CACCCCTCAC 


900 


GCCCCACCTC 


CCTGGGGGAT 


CCTTTCCGCC 


CCAGCCCTGA 


AAGCGTTAAT 


CCTGGAGCTT 


960 


TCTGCACACC 


CCCCGACCGC 


TCCCGCCCAA 


GCTTCCTAAA 


AAAGAAAGGT 


GCAAAGTTTG 


1020 


GTCCAGGATA 


GAAAAATGAC 


TGATCAAAGG 


CAGGCGATAC 


TTCCTGTTGC 


CGGGACGCTA 


1080 


TATATAACGT 


GATGAGCGCA 


CGGGCTGCGG 


AGACGCACCG 


GAGCGCTCGC 


CCAGCCGCCG 


1140 
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CCTCCAAGCC CCTGAGGTTT CCGGGGACCA CA ATG AAC AAG TTG CTG TGC TGC 1193 

Met Asn Lys Leu Leu Cys Cys 
-20 -15 

GCG CTC GTG GTAAGTCCCT GGGCCAGCCG ACGGGTGCCC GGCGCCTGGG 1242 
Ala Leu Val 

GAGGCTGCTG CCACCTGGTC TCCCAACCTC CCAGCGGACC GGCGGGGAGA AGGCTCCACT 1302 

CGCTCCCTCC CAGG 1316 

WL$m^ : 2 
BE?il£>g£ : 9 8 9 8 

m<DM. : 2 

K?iJ©H^ : genomic DNA ( t h OCIF^r J A D N A - 2 ) 
@E?IJ : 

GCTTACTTTG TGCCAAATCT CATTAGGCTT AAGGTAATAC AGGACTTTGA GTCAAATGAT 60 
ACTGTTGCAC ATAAGAACAA ACCTATTTTC ATGCTAAGAT GATGCCACTG TGTTCCTTTC 120 
TCCTTCTAG TTT CTG GAC ATC TCC ATT AAG TGG ACC ACC CAG GAA ACG TTT 171 
Phe Leu Asp He Ser He Lys Trp Thr Thr Gin Glu Thr Phe 
-10 -5 1 

CCT CCA AAG TAC CTT CAT TAT GAC GAA GAA ACC TCT CAT CAG CTG TTG 219 
Pro Pro Lys Tyr Leu His Tyr Asp Glu Glu Thr Ser His Gin Leu Leu 
5 10 15 
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TGT GAC AAA TGT CCT CCT GGT ACC TAC CTA AAA CAA CAC TGT ACA GCA 267 

Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys Gin His Cys Thr Ala 
20 25 30 35 

AAG TGG AAG ACC GTG TGC GCC CCT TGC CCT GAC CAC TAC TAC ACA GAC 315 
Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr Tyr Thr Asp 
40 45 50 

AGC TGG CAC ACC ACT GAC GAG TGT CTA TAC TGC AGC CCC GTG TGC AAG 363 
Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro Val Cys Lys 
55 60 65 

GAG CTG CAG TAC GTC AAG CAG GAG TGC AAT CGC ACC CAC AAC CGC GTG 411 
Glu Leu Gin Tyr Val Lys Gin Glu Cys Asn Arg Thr His Asn Arg Val 
70 75 80 

TGC GAA TGC AAG GAA GGG CGC TAC CTT GAG ATA GAG TTC TGC TTG AAA 459 
Cys Glu Cys Lys Glu Gly Arg Tyr Leu Glu He Glu Phe Cys Leu Lys 
85 90 95 

CAT AGG AGC TGC CCT CCT GGA TTT GGA GTG GTG CAA GCT G GTACGTGTCA 509 
His Arg Ser Cys Pro Pro Gly Phe Gly Val Val Gin Ala 
100 105 110 

ATGTGCAGCA AAATTAATTA GGATCATGCA AAGTCAGATA GTTGTGACAG TTTAGGAGAA 569 
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CACTTTTGTT 


CTGATGACAT 


TATAGGATAG 


CAAATTGCAA 


AGGTAATGAA 


ACCTGCCAGG 


629 


TAGGTACTAT 


GTGTCTGGAG 


TGCTTCCAAA 


GGACCATTGC 


TCAGAGGAAT 


ACTTTGCCAC 


689 


TACAGGGCAA 


TTTAATGACA 


AATCTCAAAT 


GCAGCAAATT 


ATTCTCTCAT 


GAGATGCATG 


749 


ATGGTTTTTT 




TAAAGAAACA 


AACTCAAGTT 


GCACTATTGA 


TAGTTGATCT 


809 


ATACCTCTAT 


ATTTCACTTC 


AGCATGGACA 


CCTTCAAACT 


GCAGCACTTT 


TTGACAAACA 


869 


TCAGAAATGT 


TAATTTATAC 


CAAGAGAGTA 


ATTATGCTCA 


TATTAATGAG 


ACTCTGGAGT 


929 


GCTAACAATA 


AGCAGTTATA 


ATTAATTATG 


TAAAAAATGA 


GAATGGTGAG 


GGGAATTGCA 


989 


TTTCATTATT 


AAAAACAAGG 


CTAGTTCTTC 


CTTTAGCATG 


GGAGCTGAGT 


GTTTGGGAGG 


1049 


GTAAGGACTA 


TAGCAGAATC 


TCTTCAATGA 


GCTTATTCTT 


TATCTTAGAC 


AAAACAGATT 


1 109 


GTCAAGCCAA 


GAGCAAGCAC 


TTGCCTATAA 


ACCAAGTGCT 


TTCTCTTTTG 


CATTTTGAAC 


1169 


AGCATTGGTC 


AGGGCTCATG 


TGTATTGAAT 


CTTTTAAACC 


AGTAACCCAC 


GTTTTTTTTC 


1229 


TGCCACATTT 


GCGAAGCTTC 


AGTGCAGCCT 


ATAACTTTTC 


ATAGCTTGAG 


AAAATTAAGA 


1289 


GTATCCACTT 


ACTTAGATGG 


AAGAAGTAAT 


CAGTATAGAT 


TCTGATGACT 


CAGTTTGAAG 


1349 


CAGTGTTTCT 


CAACTGAAGC 


CCTGCTGATA 


TTTTAAGAAA 


TATCTGGATT 


CCTAGGCTGG 


1409 


ACTCCTTTTT 


GTGGGCAGCT 


GTCCTGCGCA 


TTGTAGAATT 


TTGGCAGCAC 


CCCTGGACTC 


1469 


TAGCCACTAG 


ATACCAATAG 


CAGTCCTTCC 


CCCATGTGAC 


AGCCAAAAAT 


GTCTTCAGAC 


1529 


ACTGTCAAAT 


GTCGCCAGGT 


GGCAAAATCA 


CTCCTGGTTG 


AGAACAGGGT 


CATCAATGCT 


1589 


AAGTATCTGT 


AACTATTTTA 


ACTCTCAAAA 


CTTGTGATAT 


ACAAAGTCTA 


AATTATTAGA 


1649 


CGACCAATAC 


TTTAGGTTTA 


AAGGCATACA 


AATGAAACAT 


TCAAAAATCA 


AAATCTATTC 


1709 


TGTTTCTCAA 


ATAGTGAATC 


TTATAAAATT 


AATCACAGAA 


GATGCAAATT 


GCATCAGAGT 


1769 


CCCTTAAAAT 


TCCTCTTCGT 


ATGAGTATTT 


GAGGGAGGAA 


TTGGTGATAG 


TTCCTACTTT 


1829 


PTATTf.P.ATn 

\j 1 n 1 luun 1 u 


P.TAPTTTPAP. 


APTPAAAAP.P 


TA APPTA APT 


TPTf TPTfTP 
lululuiulu 


tp appptppp 


1 QQQ 

iooy 


GGGTGTGGAA 


TCCCATCAGA 


TAAAAGCAAA 


TCCATGTAAT 


TCATTCAGTA 


AGTTGTATAT 


1949 


GTAGAAAAAT 


GAAAAGTGGG 


CTATGCAGCT 


TGGAAACTAG 


AGAATTTTGA 


AAAATAATGG 


2009 


AAATCACAAG 


GATCTTTCTT 


AAATAAGTAA 


GAAAATCTGT 


TTGTAGAATG 


AAGCAAGCAG 


2069 


GCAGCCAGAA 


GACTCAGAAC 


AAAAGTACAC 


ATTTTACTCT 


GTGTACACTG 


GCAGCACAGT 


2129 
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GGGATTTATT TACCTCTCCC TCCCTAAAAA 
AGGTTTCCAG CCCAAAGAGA AGGAAAGACT 
ACCGTTTTGT TGTTGCTGTT GCTGTTTTGA 
TACTTCATTC TGTTAATTCC TGTGGAATTA 
TAAAGCCAAA TTTCTCCATC ATTATAATTT 
TATTTCCACT GATAGTAATA AGGTAAAATC 
AAAGTACCAT CAGTTATAGA GGGAAGTCAT 
CTTCTGAATA TATTATGAAA CATTAGTTCT 
TTTAAAAGCT AACTTACCTA AAAGAAATAT 
AGGAGAAGAC CCAAGCCACA GATATGTATC 
ACGGTGGCTC ACATCTGTAA TCTCAAGAGT 
GGTCAGGAGT TCAAGACCAG CCTGGCCAAC 
AAAAATTAGC AGGGCATGGT GGTGCATGCC 
AGGAGAATCT CTTGAACCCT CGAGGCGGAG 
CTCCAGCCTG GGTGACAGAG ATGAGACTCC 
AAAAAGATTC TTCTTCATGC AGAACATACG 
TGTCCAAGTC ACTTATTTCG AGTAAATTAG 
AAATACCTCT GCTTATGATA TTGTAGAATT 
AGGATGTAGT AGGAAAGTAC TAAAAACAAA 
AAGGTGGTTC CTAAGATAAT GTCAGTGCAA 
TTTAGGCTGT GTTTTCCCCT CCTGTTCTTT 
GTCAATGAAT CATGTAGAAA GAGACAGGAG 
TTTTTTATTT TCTGGTTTTG GTAAAAGATA 
GAAGTTTAAT AAGTTTCTGT AGCTTTGATT 
GCCAGAATTG GCCTGTAAAA TCTACATATG 
TTACACTAGA TGGAGATATT TTCATATTCA 



CCCACACAGC GGTTCCTCTT GGGAAATAAG 2189 
ATGTGGTGTT ACTCTAAAAA GTATTTAATA 2249 
AATCAGATTG TCTCCTCTCC ATATTTTATT 2309 
CTTAGAGCAA GCATGGTGAA TTCTCAACTG 2369 
CACATTTTGC CTGGCAGGTT ATAATTTTTA 2429 
ATTACTTAGA TGGATAGATC TTTTTCATAA 2489 
GTTCATGTTC AGGAAGGTCA TTAGATAAAG 2549 
GTCATTCTTA GATTCTTTTT GTTAAATAAC 2609 
CTGACACATA TGAACTTCTC ATTAGGATGC 2669 
TGAAGAATGA ACAAGATTCT TAGGCCCGGC 2729 
TTGAGAGGTC AAGGCGGGCA GATCACCTGA 2789 
ATGATGAAAC CCTGCCTCTA CTAAAAATAC 2849 
TGCAACCCTA GCTACTCAGG AGGCTGAGAC 2909 
GTTGTGGTGA GCTGAGATCC CTCTACTGCA 2969 
GTCCCTGCCG CCGCCCCCGC CTTCCCCCCC 3029 
GCAGTCAACA AAGGGAGACC TGGGTCCAGG 3089 
CAATGAAAGA ATGCCATGGA ATCCCTGCCC 3149 
TGATATAGAG TTGTATCCCA TTTAAGGAGT 3209 
CACACAAACA GAAAACCCTC TTTGCTTTGT 3269 
TGCTGGAAAT AATATTTAAT ATGTGAAGGT 3329 
TTTTCTGCCA GCCCTTTGTC ATTTTTGCAG 3389 
ATGAAACTAG AACCAGTCCA TTTTGCCCCT 3449 
CAATGAGGTA GGAGGTTGAG ATTTATAAAT 3509 
TTTCTCTTTC ATATTTGTTA TCTTGCATAA 3569 
GATATTGAAG TCTAAATCTG TTCAACTAGC 3629 
GATACACTGG AATGTATGAT CTAGCCATGC 3689 
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GTAATATAGT 


CAAGTGTTTG 


AAGGTATTTA 


TTTTTAATAG 


CGTCTTTAGT 


TGTGGACTGG 


3749 


TTCAAGTTTT 


TCTGCCAATG 


ATTTCTTCAA 


ATTTATCAAA 


TATTTTTCCA 


TCATGAAGTA 


3809 


AAATGCCCTT 


GCAGTCACCC 


TTCCTGAAGT 


TTGAACGACT 


CTGCTGTTTT 


AAACAGTTTA 


3869 


AGCAAATGGT 


ATATCATCTT 


CCGTTTACTA 


TGTAGCTTAA 


CTGCAGGCTT 


ACGCTTTTGA 


3929 


GTCAGCGGCC 


AACTTTATTG 


CCACCTTCAA 


AAGTTTATTA 


TAATGTTGTA 


AATTTTTACT 


3989 


TCTCAAGGTT 


AGCATACTTA 


GGAGTTGCTT 


CACAATTAGG 


ATTCAGGAAA 


GAAAGAACTT 


4049 


CAGTAGGAAC 


TGATTGGAAT 


TTAATGATGC 


AGCATTCAAT 


GGGTACTAAT 


TTCAAAGAAT 


4109 


GATATTACAG 


CAGACACACA 


GCAGTTATCT 


TGATTTTCTA 


GGAATAATTG 


TATGAAGAAT 


4169 


ATGGCTGACA 


ACACGGCCTT 


ACTGCCACTC 


AGCGGAGGCT 


GGACTAATGA 


ACACCCTACC 


4229 


PTTPTTTPPT 
o i I \j i I i Uu 1 


TTPPTPTPAP 


ATTTPATP AC 


CGTTTTGTAG 


GTAACGAGAA 


AATTGACTTG 


4zo9 


CATTTGCATT 


ACAAGGAGGA 


GAAACTGGCA 


AAGGGGATGA 


TGGTGGAAGT 


TTTGTTCTGT 


4349 


CTAATGAAGT 


GAAAAATGAA 


AATGCTAGAG 


TTTTGTGCAA 


CATAATAGTA 


GCAGTAAAAA 


4409 


CCAAGTGAAA 


AGTCTTTCCA 


AAACTGTGTT 


AAGAGGGCAT 


CTGCTGGGAA 


ACGATTTGAG 


4469 


GAGAAGGTAC 


TAAATTGCTT 


GGTATTTTCC 


GTAG GA ACC CCA GAG CGA AAT ACA 


4523 



Gly Thr Pro Glu Arg Asn Thr 
115 



GTT TGC AAA AGA TGT CCA GAT GGG TTC TTC TCA AAT GAG ACG TCA TCT 4571 

Val Cys Lys Arg Cys Pro Asp Gly Phe Phe Ser Asn Glu Thr Ser Ser 

120 125 130 135 

AAA GCA CCC TGT AGA AAA CAC ACA AAT TGC AGT GTC TTT GGT CTC CTG 4619 

Lys Ala Pro Cys Arg Lys His Thr Asn Cys Ser Val Phe Gly Leu Leu 

140 145 150 

CTA ACT CAG AAA GGA AAT GCA ACA CAC GAC AAC ATA TGT TCC GGA AAC 4667 
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Leu Thr Gin Lys Gly Asn Ala Thr His Asp Asn He Cys Ser Gly Asn 
155 160 165 

AGT GAA TCA ACT CAA AAA TGT GGA ATA G GTAATTACAT TCCAAAATAC 4715 
Ser Glu Ser Thr Gin Lys Cys Gly He 
170 175 

GTCTTTGTAC GATTTTGTAG TATCATCTCT CTCTCTGAGT TGAACACAAG GCCTCCAGCC 4775 
ACATTCTTGG TCAAACTTAC ATTTTCCCTT TCTTGAATCT TAACCAGCTA AGGCTACTCT 4835 
CGATGCATTA CTGCTAAAGC TACCACTCAG AATCTCTCAA AAACTCATCT TCTCACAGAT 4895 
AACACCTCAA AGCTTGATTT TCTCTCCTTT CACACTGAAA TCAAATCTTG CCCATAGGCA 4955 
AAGGGCAGTG TCAAGTTTGC CACTGAGATG AAATTAGGAG AGTCCAAACT GTAGAATTCA 5015 
CGTTGTGTGT TATTACTTTC ACGAATGTCT GTATTATTAA CTAAAGTATA TATTGGCAAC 5075 
TAAGAAGCAA AGTGATATAA ACATGATGAC AAATTAGGCC AGGCATGGTG GCTTACTCCT 5135 
ATAATCCCAA CATTTTGGGG GGCCAAGGTA GGCAGATCAC TTGAGGTCAG GATTTCAAGA 5195 
CCAGCCTGAC CAACATGGTG AAACCTTGTC TCTACTAAAA ATACAAAAAT TAGCTGGGCA 5255 
TGGTAGCAGG CACTTCTAGT ACCAGCTACT CAGGGCTGAG GCAGGAGAAT CGCTTGAACC 5315 
CAGGAGATGG AGGTTGCAGT GAGCTGAGAT TGTACCACTG CACTCCAGTC TGGGCAACAG 5375 
AGCAAGATTT CATCACACAC ACACACACAC ACACACACAC ACACATTAGA AATGTGTACT 5435 
TGGCTTTGTT ACCTATGGTA TTAGTGCATC TATTGCATGG AACTTCCAAG CTACTCTGGT 5495 
TGTGTTAAGC TCTTCATTGG GTACAGGTCA CTAGTATTAA GTTCAGGTTA TTCGGATGCA 5555 
TTCCACGGTA GTGATGACAA TTCATCAGGC TAGTGTGTGT GTTCACCTTG TCACTCCCAC 5615 
CACTAGACTA ATCTCAGACC TTCACTCAAA GACACATTAC ACTAAAGATG ATTTGCTTTT 5675 
TTGTGTTTAA TCAAGCAATG GTATAAACCA GCTTGACTCT CCCCAAACAG TTTTTCGTAC 5735 
TACAAAGAAG TTTATGAAGC AGAGAAATGT GAATTGATAT ATATATGAGA TTCTAACCCA 5795 
GTTCCAGCAT TGTTTCATTG TGTAATTGAA ATCATAGACA AGCCATTTTA GCCTTTGCTT 5855 
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TCTTATCTAA 


AAAAAAAAAA 


AAAAAAATGA 


AGGAAGGGGT 


ATTAAAAGGA 


GTGATCAAAT 


5915 


TTTAACATTC 


TCTTTAATTA 


ATTCATTTTT 


AATTTTACTT 


TTTTTCATTT 


ATTGTGCACT 


5975 


TACTATGTGG 


TACTGTGCTA 


TAGAGGCTTT 


AACATTTATA 


AAAACACTGT 


GAAAGTTGCT 


6035 


TCAGATGAAT 


ATAGGTAGTA 


GAACGGCAGA 


ACTAGTATTC 


AAAGCCAGGT 


CTGATGAATC 


6095 


CAAAAACAAA 


CACCCATTAC 


TCCCATTTTC 


TGGGACATAC 


TTACTCTACC 


CAGATGCTCT 


6155 


GGGCTTTGTA 


ATGCCTATGT 


AAATAACATA 


GTTTTATGTT 


TGGTTATTTT 


CCTATGTAAT 


6215 


GTCTACTTAT 


ATATCTGTAT 


CTATCTCTTG 


CTTTGTTTCC 


AAAGGTAAAC 


TATGTGTCTA 


6275 


AATGTGGGCA 


AAAAATAACA 


CACTATTCCA 


AATTACTGTT 


CAAATTCCTT 


TAAGTCAGTG 


6335 


ATAATTATTT 


GTTTTGACAT 


TAATCATGAA 


GTTCCCTGTG 


GGTACTAGGT 


AAACCTTTAA 


6395 


TAGAATGTTA 


ATGTTTGTAT 


TCATTATAAG 


AATTTTTGGC 


TGTTACTTAT 


TTACAACAAT 


6455 


ATTTPAPTPT 


AATTAP,APAT 


TTAPTAAAPT 


TTCTCTTGAA 


AACAATGCCC 


AAAAAAGAAC 


DO ID 


ATTAGAAGAC 


ACGTAAGCTC 


AGTTGGTCTC 


TGCCACTAAG 


ACCAGCCAAC 


AGAAGCTTGA 


6575 


TTTTATTCAA 


ACTTTGCATT 


TTAGCATATT 


TTATCTTGGA 


AAATTCAATT 


GTGTTGGTTT 


6635 


TTTGTTTTTG 


TTTGTATTGA 


ATAGACTCTC 


AGAAATCCAA 


TTGTTGAGTA 


AATCTTCTGG 


6695 


GTTTTCTAAC 


CTTTCTTTAG 


AT GTT ACC 


CTG TGT GAG GAG GCA TTC TTC AGG 


6747 



Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
180 185 



TTT GCT GTT CCT ACA AAG TTT ACG CCT AAC TGG CTT ACT GTC TTG GTA 6795 
Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val 
190 195 200 

GAC AAT TTG CCT GGC ACC AAA GTA AAC GCA GAG AGT GTA GAG AGG ATA 6843 
Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie 
205 210 215 
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AAA CGG CAA CAC AGC TCA CAA GAA CAG ACT TTC CAG CTG CTG AAG TTA 6891 
Lys Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
220 225 230 235 

TGG AAA CAT CAA AAC AAA GAC CAA GAT ATA GTC AAG AAG ATC ATC CAA G 6940 
Trp Lys His Gin Asn Lys Asp Gin Asp He Val Lys Lys He He Gin 
240 245 250 

GTATGATAAT CTAAAATAAA AAGATCAATC AGAAATCAAA GACACCTATT TATCATAAAC 7000 
CAGGAACAAG ACTGCATGTA TGTTTAGTTG TGTGGATCTT GTTTCCCTGT TGGAATCATT 7060 
GTTGGACTGA AAAAGTTTCC ACCTGATAAT GTAGATGTGA TTCCACAAAC AGTTATACAA 7120 
GGTTTTGTTC TCACCCCTGC TCCCCAGTTT CCTTGTAAAG TATGTTGAAC ACTCTAAGAG 7180 
AAGAGAAATG CATTTGAAGG CAGGGCTGTA TCTCAGGGAG TCGCTTCCAG ATCCCTTAAC 7240 
GCTTCTGTAA GCAGCCCCTC TAGACCACCA AGGAGAAGCT CTATAACCAC TTTGTATCTT 7300 
ACATTGCACC TCTACCAAGA AGCTCTGTTG TATTTACTTG GTAATTCTCT CCAGGTAGGC 7360 
TTTTCGTAGC TTACAAATAT GTTCTTATTA ATCCTCATGA TATGGCCTGC ATTAAAATTA 7420 
TTTTAATGGC ATATGTTATG AGAATTAATG AGATAAAATC TGAAAAGTGT TTGAGCCTCT 7480 
TGTAGGAAAA AGCTAGTTAC AGCAAAATGT TCTCACATCT TATAAGTTTA TATAAAGATT 7540 
CTCCTTTAGA AATGGTGTGA GAGAGAAACA GAGAGAGATA GGGAGAGAAG TGTGAAAGAA 7600 
TCTGAAGAAA AGGAGTTTCA TCCAGTGTGG ACTGTAAGCT TTACGACACA TGATGGAAAG 7660 
AGTTCTGACT TCAGTAAGCA TTGGGAGGAC ATGCTAGAAG AAAAAGGAAG AAGAGTTTCC 7720 
ATAATGCAGA CAGGGTCAGT GAGAAATTCA TTCAGGTCCT CACCAGTAGT TAAATGACTG 7780 
TATAGTCTTG CACTACCCTA AAAAACTTCA AGTATCTGAA ACCGGGGCAA CAGATTTTAG 7840 
GAGACCAACG TCTTTGAGAG CTGATTGCTT TTGCTTATGC AAAGAGTAAA CTTTTATGTT 7900 
TTGAGCAAAC CAAAAGTATT CTTTGAACGT ATAATTAGCC CTGAAGCCGA AAGAAAAGAG 7960 
AAAATCAGAG ACCGTTAGAA TTGGAAGCAA CCAAATTCCC TATTTTATAA ATGAGGACAT 8020 
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TTTAACCCAG AAAGATGAAC CGATTTGGCT TAGGGCTCAC AGATACTAAG TGACTCATGT 8080 
CATTAATAGA AATGTTAGTT CCTCCCTCTT AGGTTTGTAC CCTAGCTTAT TACTGAAATA 8140 
TTCTCTAGGC TGTGTGTCTC CTTTAGTTCC TCGACCTCAT GTCTTTGAGT TTTCAGATAT 8200 
CCTCCTCATG GAGGTAGTCC TCTGGTGCTA TGTGTATTCT TTAAAGGCTA GTTACGGCAA 8260 
TTAACTTATC AACTAGCGCC TACTAATGAA ACTTTGTATT ACAAAGTAGC TAACTTGAAT 8320 
ACTTTCCTTT TTTTCTGAAA TGTTATGGTG GTAATTTCTC AAACTTTTTC TTAGAAAACT 8380 
GAGAGTGATG TGTCTTATTT TCTACTGTTA ATTTTCAAAA TTAGGAGCTT CTTCCAAAGT 8440 
TTTGTTGGAT GCCAAAAATA TATAGCATAT TATCTTATTA TAACAAAAAA TATTTATCTC 8500 
AGTTCTTAGA AATAAATGGT GTCACTTAAC TCCCTCTCAA AAGAAAAGGT TATCATTGAA 8560 
ATATAATTAT GAAATTCTGC AAGAACCTTT TGCCTCACGC TTGTTTTATG ATGGCATTGG 8620 
ATGAATATAA ATGATGTGAA CACTTATCTG GGCTTTTGCT TTATGCAG AT ATT GAC 8676 

Asp He Asp 

CTC TGT GAA AAC AGC GTG CAG CGG CAC ATT GGA CAT GCT AAC CTC ACC 8724 
Leu Cys Glu Asn Ser Val Gin Arg His He Gly His Ala Asn Leu Thr 
255 260 265 270 

TTC GAG CAG CTT CGT AGC TTG ATG GAA AGC TTA CCG GGA AAG AAA GTG 8772 
Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val 
275 280 285 

GGA GCA GAA GAC ATT GAA AAA ACA ATA AAG GCA TGC AAA CCC ACT GAC 8820 
Gly Ala Glu Asp He Glu Lys Thr He Lys Ala Cys Lys Pro Ser Asp 
290 295 300 

CAG ATC CTG AAG CTG CTC ACT TTG TGG CGA ATA AAA AAT GGC GAC CAA 8868 
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Gin lie Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn Gly Asp Gin 
305 310 315 

GAC ACC TTG AAG GGC CTA ATG CAC GCA CTA AAG CAC TCA AAG ACG TAC 8916 
Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr 
320 325 330 

CAC TTT CCC AAA ACT GTC ACT CAG AGT CTA AAG AAG ACC ATC AGG TTC 8964 
His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr He Arg Phe 
335 340 345 350 

CTT CAC AGC TTC ACA ATG TAC AAA TTG TAT CAG AAG TTA TTT TTA GAA 9012 
Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu 
355 360 365 

ATG ATA GGT AAC CAG GTC CAA TCA GTA AAA ATA AGC TGC TTA 9054 
Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 
370 375 380 

TAACTGGAAA TGGCCATTGA GCTGTTTCCT CACAATTGGC GAGATCCCAT GGATGAGTAA 9114 
ACTGTTTCTC AGGCACTTGA GGCTTTCAGT GATATCTTTC TCATTACCAG TGACTAATTT 9174 
TGCCACAGGG TACTAAAAGA AACTATGATG TGGAGAAAGG ACTAACATCT CCTCCAATAA 9234 
ACCCCAAATG GTTAATCCAA CTGTCAGATC TGGATCGTTA TCTACTGACT ATATTTTCCC 9294 
TTATTACTGC TTGCAGTAAT TCAACTGGAA ATTAAAAAAA AAAAACTAGA CTCCACTGGG 9354 
CCTTACTAAA TATGGGAATG TCTAACTTAA ATAGCTTTGG GATTCCAGCT ATGCTAGAGG 9414 
CTTTTATTAG AAAGCCATAT TTTTTTCTGT AAAAGTTACT AATATATCTG TAACACTATT 9474 
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ACAGTATTGC TATTTATATT CATTCAGATA TAAGATTTGG ACATATTATC ATCCTATAAA 9534 
GAAACGGTAT GACTTAATTT TAGAAAGAAA ATTATATTCT GTTTATTATG ACAAATGAAA 9594 
GAGAAAATAT ATATTTTTAA TGGAAAGTTT GTAGCATTTT TCTAATAGGT ACTGCCATAT 9654 
TTTTCTGTGT GGAGTATTTT TATAATTTTA TCTGTATAAG CTGTAATATC ATTTTATAGA 9714 
AAATGCATTA TTTAGTCAAT TGTTTAATGT TGGAAAACAT ATGAAATATA AATTATCTGA 9774 
ATATTAGATG CTCTGAGAAA TTGAATGTAC CTTATTTAAA AGATTTTATG GTTTTATAAC 9834 
TATATAAATG ACATTATTAA AGTTTTCAAA TTATTTTTTA TTGCTTTCTC TGTTGCTTTT 9894 
ATTT 9898 

ffi^JS^- : 3 

U?IJ <Dfi£ : 4 0 1 

m<DM. : 1 

K^IJOaiS : 161 

Met Asn Asn Leu Leu Cys Cys Ala Leu Val Phe Leu Asp He Ser 

-20 -15 -10 

He Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His 

-5 1 5 

Tyr Asp Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro 
10 15 20 

Pro Gly Thr Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr 
25 30 35 

Val Cys Ala Pro Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His 
40 45 50 
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Thr Ser 
55 

Gin Tyr 
70 

Glu Cys 
85 

His Arg 
100 

Pro Glu 
115 

Ser Asn 
130 

Cys Ser 
145 

His Asp 
160 

Gly lie 
175 

Val Pro 
190 

Asn Leu 

205 

Lys Arg 
220 

Leu Trp 

235 



Asp Glu 
Val Lys 
Lys Glu 
Ser Cys 
Arg Asn 
Glu Thr 
Val Phe 
Asn He 
Asp Val 
Thr Lys 
Pro Gly 
Gin His 
Lys His 



Cys Leu 
60 

Gin Glu 
75 

Gly Arg 
90 

Pro Pro 

105 
Thr Val 

120 
Ser Ser 

135 
Gly Leu 

150 
Cys Ser 

165 
Thr Leu 

180 
Phe Thr 

195 
Thr Lys 

210 
Ser Ser 

225 
Gin Asn 

240 



Tyr Cys 
Cys Asn 
Tyr Leu 
Gly Phe 
Cys Lys 
Lys Ala 
Leu Leu 
Gly Asn 
Cys Glu 
Pro Asn 
Val Asn 
Gin Glu 
Lys Asp 



7 

Ser Pro 
Arg Thr 
Glu He 
Gly Val 
Arg Cys 
Pro Cys 
Thr Gin 
Ser Glu 
Glu Ala 
Trp Leu 
Ala Glu 
Gin Thr 
Gin Asp 



Val Cys 
65 

His Asn 
80 

Glu Phe 
95 

Val Gin 
110 

Pro Asp 
125 

Arg Lys 
140 

Lys Gly 
155 

Ser Thr 
170 

Phe Phe 
185 

Ser Val 
200 

Ser Val 
215 

Phe Gin 
230 

He Val 
245 



Lys Glu Leu 
Arg Val Cys 
Cys Leu Lys 
Ala Gly Thr 
Gly Phe Phe 
His Thr Asn 
Asn Ala Thr 
Gin Lys Cys 
Arg Phe Ala 
Leu Val Asp 
Glu Arg He 
Leu Leu Lys 
Lys Lys He 
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He Gin Asp He Asp Leu Cys Glu Asn Ser Val Gin Arg His lie 

250 255 260 

Gly His Ala Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu 

265 270 275 

Ser Leu Pro Gly Lys Lys Val Gly Ala Glu Asp He Glu Lys Thr 

280 285 290 

He Lys Ala Cys Lys Pro Ser Asp Gin He Leu Lys Leu Leu Ser 

295 300 305 

Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr Leu Lys Gly Leu 

310 315 320 

Met His Ala Leu Lys His Ser Lys Thr Tyr His Phe Pro Lys Thr 

325 330 335 

Val Thr Gin Ser Leu Lys Lys Thr lie Arg Phe Leu His Ser Phe 

340 345 350 

Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met He Gly 

355 360 365 

Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 

370 375 380 



mzm^- : 4 

!2?IJcDfi£ : 1 2 0 6 

mm cr>m : mm 
m&m. ■. i 

ffizwmm : c DNA 

mm : 
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ATGAACAACT 


TGCTGTGCTG 


CGCGCTCGTG 


TTTCTGGACA 


TCTCCATTAA 


GTGGACCACC 


60 


CAGGAAACGT 


TTCCTCCAAA 


GTACCTTCAT 


TATGACGAAG 


AAACCTCTCA 


TCAGCTGTTG 


120 


TGTGACAAAT 


GTCCTCCTGG 


TACCTACCTA 


AAACAACACT 


GTACAGCAAA 


GTGGAAGACC 


180 


GTGTGCGCCC 


CTTGCCCTGA 


CCACTACTAC 


ACAGACAGCT 


GGCACACCAG 


TGACGAGTGT 


240 


CTATACTGCA 


GCCCCGTGTG 


CAAGGAGCTG 


CAGTACGTCA 


AGCAGGAGTG 


CAATCGCACC 


300 


CACAACCGCG 


TGTGCGAATG 


CAAGGAAGGG 


CGCTACCTTG 


AGATAGAGTT 


CTGCTTGAAA 


360 


CATAGGAGCT 


GCCCTCCTGG 


ATTTGGAGTG 


GTGCAAGCTG 


GAACCCCAGA 


GCGAAATACA 


420 


GTTTGCAAAA 


GATGTCCAGA 


TGGGTTCTTC 


TCAAATGAGA 


CGTCATCTAA 


AGCACCCTGT 


480 


AGAAAACACA 


CAAATTGCAG 


TGTCTTTGGT 


CTCCTGCTAA 


CTCAGAAAGG 


AAATGCAACA 


540 


CACGACAACA 


TATGTTCCGG 


AAACAGTGAA 


TCAACTCAAA 


AATGTGGAAT 


AGATGTTACC 


600 


CTGTGTGAGG 


AGGCATTCTT 


CAGGTTTGCT 


GTTCCTACAA 


AGTTTACGCC 


TAACTGGCTT 


660 


AGTGTCTTGG 


TAGACAATTT 


GCCTGGCACC 


AAAGTAAACG 


CAGAGAGTGT 


AGAGAGGATA 


720 


AAACGGCAAC 


ACAGCTCACA 


AGAACAGACT 


TTCCAGCTGC 


TGAAGTTATG 


GAAACATCAA 


780 


AACAAAGACC 


AAGATATAGT 


CAAGAAGATC 


ATCCAAGATA 


TTGACCTCTG 


TGAAAACAGC 


840 


GTGCAGCGGC 


ACATTGGACA 


TGCTAACCTC 


ACCTTCGAGC 


AGCTTCGTAG 


CTTGATGGAA 


900 


AGCTTACCGG 


GAAAGAAAGT 


GGGAGCAGAA 


GACATTGAAA 


AAACAATAAA 


GGCATGCAAA 


960 


CCCAGTGACC 


AGATCCTGAA 


GCTGCTCAGT 


TTGTGGCGAA 


TAAAAAATGG 


CGACCAAGAC 


1020 


ACCTTGAAGG 


GCCTAATGCA 


CGCACTAAAG 


CACTCAAAGA 


CGTACCACTT 


TCCCAAAACT 


1080 


GTCACTCAGA 


GTCTAAAGAA 


GACCATCAGG 


TTCCTTCACA 


GCTTCACAAT 


GTACAAATTG 


1140 


TATCAGAAGT 


TATTTTTAGA 


AATGATAGGT 


AACCAGGTCC 


AATCAGTAAA 


AATAAGCTGC 


1200 



TTATAA 1206 
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